Regular Expression Filtering on Multiple <i>q</i>-Grams
نویسندگان
چکیده
منابع مشابه
Better Filtering with Gapped q-Grams
A popular and well-studied class of filters for approximate string matching compares substrings of length q, the q-grams, in the pattern and the text to identify text areas that contain potential matches. A generalization of the method that uses gapped q-grams instead of contiguous substrings is mentioned a few times in literature but has never been analyzed in any depth. In this paper, we repo...
متن کاملWords vs. Character N-grams for Anti-spam Filtering
The increasing number of unsolicited e-mail messages (spam) reveals the need for the development of reliable anti-spam filters. The vast majority of content-based techniques rely on word-based representation of messages. Such approaches require reliable tokenizers for detecting the token boundaries. As a consequence, a common practice of spammers is to attempt to confuse tokenizers using unexpe...
متن کاملMultiple Views on Filtering: Consumers, Producers and Filtering Technology Designers
information filtering We discuss the implications of information consumer and producer needs for the provider of basic filtering technology. To understand the consumer point of view, we outline seven different filtering goals that cover a wide variety of filtering questions, including obtaining an overview, identifying a trend, and finding a match between a prototype and an instantiation. We di...
متن کاملRegular Expression Acceleration at Multiple Tens of Gb/s
The frequency of network attacks increases every year, and at the same time the attack methods are becoming more sophisticated. To keep up with these trends, signature-based NIDSs such as Snort [1] apply more powerful and flexible content-filtering rules, often involving pattern conditions defined by regular expressions. This has triggered a substantial amount of research and product developmen...
متن کاملNon-regular processes and singular Kalman filtering
Contrary to the continuous-time case, a discrete-time process y can be represented by minimal linear models (see (1.1) below), which may either have a non-singular or a singular D matrix. In fact, models with D = 0 have been commonly used in the statistical literature. On the other hand, for models with a singular D matrix the Riccati difference equation of Kalman filtering involves in general ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2018
ISSN: 0916-8532,1745-1361
DOI: 10.1587/transinf.2017edl8180